Hierarchical Contrast for Unsupervised Skeleton-Based Action Representation Learning

نویسندگان

چکیده

This paper targets unsupervised skeleton-based action representation learning and proposes a new Hierarchical Contrast (HiCo) framework. Different from the existing contrastive-based solutions that typically represent an input skeleton sequence into instance-level features perform contrast holistically, our proposed HiCo represents multiple-level performs in hierarchical manner. Specifically, given human sequence, we it multiple feature vectors of different granularities both temporal spatial domains via sequence-to-sequence (S2S) encoders unified downsampling modules. Besides, is conducted terms four levels: instance level, domain clip part level. Moreover, orthogonal to S2S encoder, which allows us flexibly embrace state-of-the-art encoders. Extensive experiments on datasets, i.e., NTU-60, NTU-120, PKU-I PKU-II, show achieves for two downstream tasks including recognition retrieval, its learned good transferability. also framework effective semi-supervised recognition. Our code available at https://github.com/HuiGuanLab/HiCo.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised learning of an embodied representation for action selection

We propose a principle on how a computational agent can learn the structure of a classic discrete state space. The idea is to do a kind of principal component analysis on a matrix describing transitions from one state to another. This transforms the space of discrete, completely separate, states into a dimensional representation in a Euclidean space. The representation supports action selection...

متن کامل

Deep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning

Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...

متن کامل

Discourse-Based Objectives for Fast Unsupervised Sentence Representation Learning

This work presents a novel objective function for the unsupervised training of neural network sentence encoders. It exploits signals from paragraph-level discourse coherence to train these models to understand text. Our objective is purely discriminative, allowing us to train models many times faster than was possible under prior methods, and it yields models which perform well in extrinsic eva...

متن کامل

Unsupervised Learning aids Prediction: Using Future Representation Learning Variantial Autoencoder for Human Action Prediction

The unsupervised Pretraining method has been widely used in aiding human action recognition. However, existing methods focus on reconstructing the already present frames rather than generating frames which happen in future.In this paper, We propose an improved Variantial Autoencoder model to extract the features with a high connection to the coming scenarios, also known as Predictive Learning. ...

متن کامل

A Hierarchical Representation for Future Action Prediction

We consider inferring the future actions of people from a still image or a short video clip. Predicting future actions before they are actually executed is a critical ingredient for enabling us to effectively interact with other humans on a daily basis. However, challenges are two fold: First, we need to capture the subtle details inherent in human movements that may imply a future action; seco...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i1.25127